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Introducing Variety in Risk Management 
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"Yesterday the S&P500 went up by 3%". Is this number telhng all the story 
if half the stocks went up 5% and half went down 1%? Surely one can do a little 
better and give two figures, the average and the dispersion around this average, 
that two of us have recently christened the variety |jl|]. 

c^ ■ Call Tiit) the return of asset i on day t. The variety V(t) is simply the root 

Vh ■ mean square of the stock returns on a given day: 

g: v\t) = -Y:{n{t) - r^{t))\ (1) 

where A^ is the number of stocks and r^ = {^/N)Yl,i^i is the market average. 
^ \ If the variety is, say, 0.1%, then most stocks have indeed made between 2.9% 

(^ ■ and 3.1%. But if the variety is 10%, then stocks followed rather different trends 

CN . during the day and their average happened to be positive, but this is just an 

Q ! average information. 

The variety is not the volatility of the index. The volatility refers to the 

amplitude of the fluctuations of the index from one day to the next, not the 
c^ ■ dispersion of the result between different stocks. Consider a day where the market 

has gone down 5% with a variety of 0.1% - that is, all stocks have gone down 
,^ . by nearly 5%. This is a very volatile day, but with a low variety. Note that low 

^H ! variety means that it is hard to diversify: all stocks behave the same way. 

X \ The intuition is however that there should be a correlation between volatility 

and variety, probably a positive one: when the market makes big swings, stocks 

are expected to be all over the place. This is actually true. Indeed the correlation 

coefficient between V(t) and \rm\ is 0.68.Q The variety is, on average, larger when 
5^ \ the amplitude of the market return is larger (see the discussion below). Very 

much like the volatility, the variety is correlated in time: there are long periods 

where the market volatility is high and where the market variety is high (see Fig. 

1). Technically, the temporal correlation function of these two objects reveal a 

similar slow (power-law like) decay with time. 

"'^This value is not an artifact due to outliers. In fact an estimation of the Spearman rank- 
order correlation coefficient gives the value of 0.37 with a significance level of 10"'^^. 



Figure 1 about here 



A theoretical relation between variety and market average return can be ob- 
tained within the framework of the one-factor model, that suggests that the vari- 
ety increases when the market volatility increases. The one-factor model assumes 
that ri{t) can be written as: 

r,(t) = a, + A7^„(t) + Q(t), (2) 

where cij is the expected value of the component of security i's return that is 
independent of the market's performance (this parameter usually plays a minor 
role and we shall neglect it), /3j is a coefficient usually close to unity that we will 
assume to be time independent, IZmif) is the market factor and ej(t) is called 
the idiosyncratic return, by construction uncorrelated both with the market and 
with other idiosyncratic factors. Note that in the standard one-factor model the 
distributions of TZm and e, are chosen to be Gaussian with constant variances, we 
do not make this assumption and let these distributions be completely general 
including possible volatility fluctuations. 

In the study of the properties of the one-factor model it is useful to consider 
the variety v{t) of idiosyncratic part, defined as 

1 ^ 
v'it) = ^Y.Ut)f. (3) 

Under the above assumptions the relation between the variety and the market 
average return is well approximated by (see Box 1 for details): 

V\t)^v\t)+^p\l{t), (4) 

where A/3^ is the variance of the /5's divided by the square of their mean. 

Therefore, even if the idiosyncratic variety v is constant, Eq. (4) predicts 
an increase of the volatility with r^, which is a proxy of the market volatility. 
Because A/9^ is small, however, this increase is rather small. As we shall now 
discuss, the effect is enhanced by the fact that v itself increases with the market 
volatility. 

In its simplest version, the one-factor model assumes that the idiosyncratic 
part ej is independent of the market return. In this case, the variety of idiosyn- 
cratic terms v{t) is constant in time and independent from r^. In Fig. 2 we 
show the variety of idiosyncratic terms as a function of the market return. In 
contrast with these predictions, the empirical results show that a significant corre- 
lation between v{t) and rj„(t) indeed exists. The degree of correlation is different 
for positive and negative values of the market average. In fact, the best linear 
least-squares fit between v{t) and rm{t) provides different slopes when the fit 



is performed for positive (slope +0.55) or negative (slope —0.30) value of the 
market average. We have again checked that these slopes are not governed by 
outliers by repeating the fitting procedure in a robust way. The best fits obtained 
with this procedure are shown in Fig. 2 as dashed lines. The slopes of the two 
lines are -0.25 and 0.51 for negative and positive value of the market average, 
respectively. Therefore, from Eq.(4) we find that the increase of variety in highly 
volatile periods is stronger than what is expected from the simplest one-factor 
model, although not as strong for negative (crashes) than it is for positive (rally) 
days. By analyzing the three largest crashes occurred at the NYSE in the period 
from January 1987 to December 1998, we observe two characteristics of the vari- 
ety which are recurrent during the investigated crashes: (i) the variety increases 
substantially starting from the crash day and remains at a level higher than typ- 
ical for a period of time of the order of sixty trading days; (ii) the highest value 
of the variety is observed the trading day immediately after the crash. 



Figure 2 about here 



An important quantity for risk management purposes is the degree of correla- 
tion between stocks. If this correlation is too high, diversification of risk becomes 
very difficult to achieve. A natural way 0] to characterize the average correlation 
between all stocks i,j on a given day C(t) is to define the following quantity: 

As shown in the technical Box 1, to a good approximation one finds: 

As mentioned above, the variety of the idiosyncratic terms is constant in time 
in the simplest one-factor model. The correlation structure in this version of the 
one-factor model is very simple and time independent. Still, the quantity C, taken 
for a proxy of the correlations on a given day, increases with the 'volatility' r^, 
simply because F decreases. As shown in Fig. 3, the simplest one factor model 
in fact overestimates this increase 0]. Because the idiosyncratic variety f^(t) 
tends to increase when \rm\ increases (see Figure 2), the quantity F{t) is in fact 
larger and C is smaller. This may suggest that, at odds with the common lore, 
correlations actually are less effective than expected using a one-factor model in 
high volatility periods: the unexpected increase of variety gives an additional 
opportunity for diversification. Other, more subtle indicators of correlations, like 
the exceedance correlation function defined in Box 2 and shown in Fig. 4 (see 



[0), actually confirm that the comnionly reported increase of correlations during 
highly volatile bear periods might only reflect the inadequacy of the indicators 
that are used to measure them. 



Figures 3 and 4 about here 



Therefore, the idiosyncrasies are by construction uncorrelated, but not in- 
dependent of the market. This shows up in the variety, does it also appear in 
different quantities? We have proposed above to add to the market return the 
variety as a second indicator. One can probably handle a third one, which gives 
a refined information of what happened in the market on a particular day. The 
natural question is indeed: what fraction / of stocks did actually better than the 
market? A balanced market would have / = 50%. If / is larger than 50%, then 
the majority of the stocks beat the market, but a few ones lagging behind rather 
badly, and vice versa. A closely related measure is the asymmetry A, defined as 
A{t) = rm{t) — r*{t), where the median r* is, by definition, the return such that 
50% of the stocks are above, 50% below. If / is larger than 50%, then the median 
is larger than the average, and vice versa. Is the asymmetry A also correlated 
with the market factor? Fig. 5 shows that it is indeed the case: large positive 
days show a positive skewness in the distribution of returns - that is, a few stocks 
do exceptionally well - whereas large negative days show the opposite behaviour. 
In the figure each day is represented by a circle and all the circles cluster in 
a pattern which has a sigmoidal shape. The asymmetrical behaviour observed 
during two extreme market events is shown in the insets of Fig. 5 where we 
present the probability density function of returns observed in the most extreme 
trading days of the period investigated in Ref. 0|. This empirical observation 
cannot be explained by a one-factor model. This has been shown by two different 
approaches: (i) by comparing empirical results with surrogate data generated by 
a one-factor model |^ and (ii) by considering directly the asymmetry of daily 
idiosyncrasies [0]. Intuitively, one possible explanation of this anomalous skew- 
ness (and a corresponding increase of variety) might be related to the existence 
of sectors which strongly separate from each other during volatile days. 



Figure 5 about here 



The above remarks on the dynamics of stocks seen as a population are im- 
portant for risk control, in particular for option books, and for long-short equity 
trading programs. The variety is in these cases almost as important to monitor 
as the volatility. Since this quantity has a very intuitive interpretation and an 



unambiguous definition (given by Eq. (1)), this could become a liquid financial 
instrument which may be used to hedge market neutral positions. Indeed, market 
neutrality is usually insured for 'typical' days, but is destroyed in high variety 
days. Buying the variety would in this case reduce the risk of these approximate 
market neutral portfolios. 
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1 Technical Box 1: Proof of Eqs. (4) and (6) 

Here we show that if the number of stocks A^ is large, then up to terms of order 
l/v^iV, Eqs. d) and (|) indeed hold. We start from Eq. (g) with a^ = 0. 
Summing over i = 1, ..,N this equation, we find: 

Since for a given t the idiosyncratic factors are uncorrelated from stock to stock, 
the second term on the right hand side is of order 1/viV, and can thus be ne- 
glected in a first approximation giving 

r™(t)~;^7^„,(t), (8) 

where /? = J2f=iPi/N. In order to obtain Eq. (4), we square Eq. (2) and 
summing over i = 1, ..,N, we find: 

T N 1 N T N 1 N 

-Erf(t) = 7^^(t)-E A' + ^^E^'W + 2 7e^W^EAQ(t) (9) 

i=l i=l i=l 2=1 

Under the assumption that ei(t) and Pi are uncorrelated the last term can be 
neglected and the variety V(t) defined in Eq. (1) is given by 

v'{t)^v\t) + (^-f)nl{t) (10) 

This is the relation between V(t) and the market factor TZm{t). By inserting Eq. 
(8) in the previous equation one obtains Eq. (4). 



Now consider Eq. (|]). Using the fact that YliLiri/N = Vm, we find that 
the numerator is equal to r^ up to terms of order 1/A^. Inserting Eq. (Q) in 
the denominator and again neglecting the cross-product terms that are of order 
1/VN, we find: 

cw^ ^^:W .,.. ^-^. m^m. (11) 



iJ'TlUt) + vHt) X + F(ty ri(ty 



,-?& 



where x = fP j (5 . This quantity is empirically found to be ~ 1.05 for the S&P 
500, and we have therefore replaced it by 1 in Eq. (^. 

2 Box 2: Exceedance correlations 

In order to test the structure of the cross-correlations during highly volatile peri- 
ods, Longin and Solnik (0|) have proposed to study the 'exceedance correlation', 
defined for for a given pair ij of stocks as follows: 

where the subscript > 6 means that both normalized returns are larger than 
6, and fj are normalized centered returns. The negative exceedance correlation 
Pij{0) is defined similarly, the conditioning being now on returns smaller than 
9. We have plotted the average over all pairs of stocks p'^{9) for positive 9 and 
P-{9) for negative 9, both for empirical data and for surrogate data generated 
according to a non- Gaussian one factor model Eq. (^, where both the market 
factor and the idiosyncratic factors have fat tails compatible with empirical data 
0. Note that empirical exceedance correlations grow with \9\ and are strongly 
asymmetric. For a Gaussian model, p^{9) would have a symmetric tent shape, 
i.e. it would decrease with \9\\ 

In conclusion, most of the downside exceedance correlations seen in Fig. 4 
can be explained if one factors in properly the fat tails of the unconditional 
distributions of stock returns and the skewness of the index [0, and does not 
require a specific correlation increase mechanism. 
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Figure 1: Time evolution of the daily variety V(t) of 1071 NYSE stocks continu- 
ously traded from January 1987 to December 1998. The time evolution presents 
slow dynamics and several bursts. The highest peak is observed at and imme- 
diately after the Black Monday. The highest value corresponds to the day after 
Black Monday. In the inset we show the autocorrelation function (ACF) of V(t). 
The autocorrelation has a slow decay in time and is still as high as 0.15 after 100 
trading days. 
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Figure 2: Daily variety v of idiosyncratic terms of the one-factor model (Eq. (3)) 
as a function of the market average r.^ of the 1071 NYSE stocks continuously 
traded from January 1987 to December 1998. Each circle refers to one trading 
day of the investigated period. In the main panel we show the trading days with 
Tm belonging to the interval from —0.05 to 0.05, whereas in the inset we show the 
whole data set including five most extreme days. The two solid lines are linear 
fits over all days of positive (right line) and negative (left line) market average. 
The slope of the two lines are +0.55 ± 0.02 (right) and -0.30 ± 0.02 (left). The 
tick distance in the ordinate of the inset is equal to the one of the main panel. 
The two dashed lines are linear fits obtained with a robust local M-estimates 
minimizing the absolute deviation. The slope of the two lines are +0.51 (right) 
and —0.25 (left). These values are quite close to the previously obtained ones, 
showing that the role of outliers is minor. 
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Figure 3: Correlation measure C conditional to the absolute market return to be 
larger than Ir^l, both for the empirical data and for surrogate data using a (non 
Gaussian) one- factor model 0. Note that both show a similar apparent increase 
of correlations with \rm\- This effect is actually overestimated by the one-factor 
model with fixed residual volatilities. \rm\ is in percents. 
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Figure 4: Average exceedance correlation functions between all pair of stocks as 
a function of the level parameter 6, both for real data and the surrogate non 
Gaussian one-factor model. 
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Figure 5: Daily asymmetry A of the probability density function of daily returns 
of a set of 1071 NYSE stocks continuously traded from January 1987 to December 
1998 as a function of the market average r^- Each circle refers to one trading day 
of the investigated period. In the insets we show the probability density function 
of daily returns observed for the two most extreme market days of the period 
investigated. Specifically, the left inset refers to Black Monday (October 19th, 
1987) and the right inset refers to October 21st, 1987. The negative (left inset) 
and positive (right inset) skewness of the distribution is clearly seen in both cases. 
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